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ABSTRACT 

We study cosmic variance in deep high redshift surveys and its influence on 
the determination of the luminosity function for high redshift galaxies. For sev- 
eral survey geometries relevant for HST and JWST instruments, we characterize 
the distribution of the galaxy number counts. This is obtained by means of ana- 
lytic estimates via the two point correlation function in extended Press-Schechter 
theory as well as by using synthetic catalogs extracted from N-body cosmological 
simulations of structure formation. We adopt a simple luminosity - dark halo 
mass relation to investigate the environment effects on the fitting of the lumi- 
nosity function. We show that in addition to variations of the normalization of 
the luminosity function, a steepening of its slope is also expected in underdense 
fields, similarly to what is observed within voids in the local universe. Therefore, 
to avoid introducing artificial biases, caution must be taken when attempting to 
correct for field underdensity, such as in the case of HST UDF i-dropout sample, 
which exhibits a deficit of bright counts with respect to the average counts in 
GOODS. A public version of the cosmic variance calculator based on the two 
point correlation function integration is made available on the web. 

Subject headings: galaxies: high-redshift - galaxies: statistics - large-scale struc- 
ture of the universe 
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Introduction 



Deep high redshift observations are providing a unique insight into the early stages of 
galaxy formation, when the universe was no more than one billion years old. Large samples 
of galaxies at z > 5 exist and allow us to study with unprecedented details the early epoch 



of galaxy assembly and star formation history fe.g. seelSteidel et al 



Giayalisccliooi feouwens etaPbooi iMobasher etalhood : feeckwith etaDbood : besch et al 



1996: 



Madau et al 
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6; 



2007 [) as well as to constrain the properties of sources responsib le for reionization (e.g. see 



Stiavelli et al. 2004; Yan & Windhorst 2004; Bunker et al. 



l2006h . Hunting for high- z galax- 



ies usually follows two complementary ap proaches. One can either go f or large area surveys , 
such as GOODS JGiavalisco et all l2004h and, especially, COSMOS JScoville et all l2006h . 
where the luminosit y limit is about o r focus on small are as of the sky, such as has been 
done for the HDF Jwilliams et al.l Il996h and for the UDF faeckwith et al.l l2Q06h . In the 
latter case, the magnitude limit is 2.5 — 3 mag below (UDF limit) and the aim is primar- 
ily that of probing the faint end of the luminosity function. While this strategy has a good 
payoff in terms of galaxy detections, particularly when the slope of the luminosity function 
approaches two near the magnitude limit of the observations, the field of view of these pencil 
beam surveys is usually rather small, with the edge of the order of a few hundred arcsec- 
onds. Therefore the n umber counts are significantly influenced by cosmic variance (e.g., see 
Somerville et al.ll2004l ). For example, the surface density of i-dropouts at the GOODS depth 
found in one of the two HUDF-NICM OS parallel fields (GO 9803, PI Thompson - identified 



as HUDFP2 in iBouwens et~al1l2006h i s only one third of the average value from all the 30 



ACS fields in GOODS (lBouwens et al.ll2006l ). Even considering the much larger area of one 
GOODS field (about 10' x 16') the expected on e sigma uncertainty in the number counts for 
Lyman Break galaxies at z ~ 6 is still ~ 20% (jSomerville et al.l 120041 ) . 



Relatively little effort has been so far devoted at quantifying the impact of cosmic 
variance on the determination of properties of high redshift galaxies. Past studies have 
generally character i zed only the variance of the number counts of a given cosmic volu me 
(IMo fc Whit"elll996l : Icolombi et al.l fioool : iNewman fc Davisl I2OO2I ; ISomerville et al.ll2004h or 
studied the full distr ibution but only in numerical simulations appropriate for local galaxies 
( jSzapudi et al.ll2000l ). The impact of cosmic variance on the measurement of the shape of 
the luminosity function at high redshift has never been addressed, despite the fact that it 
is known that there is an environmental depen dence in the local universe, as M* in voids is 
about one magnitude fainter than in the field (IHoyle et all 120051 ) . Our goal is to highlight 
the problem and to provide methods for addressing it. 



In this paper we estimate the variance of the number counts using extended Press- 
Schechter analysis and we compute the distribution of number counts in synthetic surveys 
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generated by using a Monte Carlo pencil beam tracer in snapshots from cosmological N-body 
simulations. Our runs follow the evolution of dark matter particles o nly. Galaxy luminosities 
are l inked to the dark matter mass a dopting the parametric models by I Vale fc Ostrikerl (120041 ) 
and ICooray &: Milosavljeviq (120051 ). that provide quite realistic output luminosity function. 
This numerical approach allows us to investigate how the best fitting parameters of the 
luminosity function depend on the environment probed. 

The paper is organized as follows. In Sec [2] we define cosmic variance and we compute it 
using the extended Press-Schechter formalism. In sec. Owe describe the numerical framework 
that we adopted to investigate cosmic variance. In sec. 0] we apply the Monte Carlo code 
to characterize the number counts for a variety of high-z surveys strategies, while in sec. [5] 
we investigate the impact of number counts variance on the determination of the luminosity 
function, focusing in particular on the faint end slope of i-dropout galaxies in the UDF main 
and parallel fields. We give our conclusions in sec. [61 



2. Large scale structure and uncertainties in galaxy number counts 



The number counts of galaxies in a survey are affected by a combination of discrete 
sampling, observational incompleteness and large scale structure. Given a probability distri- 
bution p(N) for the number counts with mean (N) and variance (N 2 ), we define the total 
fractional error of the counts N as: 



Vr 



(N) 



The uncertainties in excess to Poisson shot noise are usually quantified in t erms of a relative 
"cosmic variance" (e.g. see 



Newman Sz David 120021 : ISomerville et al.l 12004 ) defined as: 
o (N 2 ) - (N) 2 1 



a: 



(NY 



(N) 



(2) 



To evaluate the relative cosmic variance a 2 of a given sample, two main theoretical 
approaches are possible: (i) estimation based on the two point correlation function £(r) of 
the sample or (ii) direct measurement using mock catalogs from cosmological simulations of 
structure formation. 



In the first case a 2 is derived from £(r) as follows (e.g. see iPeebled 1 19931 ) : 



<j- 



Iv Jv d 3 x 1 d 3 x 2 £(\x 1 -x 2 \) 
J v J v d 3 x 1 d 3 x 2 



(3) 
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where the integration is carried out over the volume V observed by the survey. The two point 
correlation function £(r) can either be derived from a cosm ological model for the growth 
of density perturbations (e.g. see iNewman fc David |2002| ). or from a simpler analytical 
model, such as a pow er law, with free parameters fixed by observational data (e.g. see 



Somerville et al.ll2004j ). This method has the advantage of requiring little computational 
resources, limited to the evaluation of the multidimensional integral in Eq. [3j while being 
able to handle an arbitrarily large survey volume of any geometry. The main limitation is that 
only the second moment of the counts probability distribution p(N) can be obtained. This 
method also relies on an input two point correlation function which is t ypically evaluated 
only considering linear evolution of density perturbations (see however iPeacock fc Dodds 
19961 for an analytical model of the non linear evolution of the power spectrum). In addition 
this framework assumes that all the survey volume is at a given redshift, which may not be 
appropriate for high redshift (z > 5) Lyman Break Galaxies observations, where the pencil 
beam extends for about Az « 1, an interval over which there is evidence of evolution of the 



galaxy luminosity function (e.g., see lBouwens et al.l 120061 ) . 



A direct measurement of p(N) from N-body simulations bypasses all these limitations 
but requires significant computational resources. In particular the cosmic volume simulated 
must be much larger than the survey volume, which in practice sets a limit on the survey 
area that can be adequately modeled. 

To summarize, if one is interested primarily in computing a total error budget on the 
number counts of a survey, the estimate of through integration of £(r) may be sufficient and 
requires relatively little effort. However in this paper we combine both methods, emphasizing 
especially the construction of mock catalogs from N-body simulations, as this is required to 
address the influence of cosmic variance on the uncertainty in the shape of the luminosity 
function. 



2.1. Cosmic variance in Extended Press Schechter Theory 



Following INewman fc David (120021 ) we estimate the cosmic variance in l inear theory eval- 
uating the integral of Eq. [2] using £(r) derived from the transfer function of lEisenstein fc Hu 
(119991 ). For a given redshift z, t he halo-dark mat t er bia s b(Mh) as a function of the halo 
mass (M h ) is evaluated using the lSheth fc Tormenl (119991 ) for malism. The total averag e bias 
of the sample is then computed by averaging b(Mh) over the ISheth &: Tormenl (119991 ) mass 
function down to a mass limit set by matching the desired comoving number density of halos. 



These are the basic steps to evaluate the influence of cosmic variance in the error budget 
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of the number counts using linear theory. 

• Define the survey volume and its average redshift z av . For example, for Lyman Break 
Galaxies dropouts samples, this is set by the combination of the field of view angular 
size and of the redshift window for the dropout selection. 

• Choose the intrinsic number of objects in the survey. For example, this can be done 
either starting from a specific luminosity function or estimating the number of expected 
objects from the actual number of observed objects divided by their completeness ratio. 

• Estimate the average incompleteness. Incompleteness will be close to for selections 
much brighter than the magnitude limit of the survey b ut can be in the ra nge 0.3 — 0.5 



when pushing detections up to the limit of the data (jOesch et al.l 120071 ). A precise 



estimate generally requires object recovery Monte Carlo simulations. 

Adopt a value for the average target - halo filling factor. This is in general smaller than 
1, as a specific class of objects may be visible only for a limited period of time. For 
example in the case of Lym an Break Galaxies, the duty cycle may be as low as 0.25 



^e.g. see 



Verma et alJl2007f ). 



The input information above is then used to estimate the total fractional uncertainty 
on the number counts as follows. 



Compute the minimum halo mass M m i n required to obtain the number density of halos 
hosting the survey population. Combining halo filling factor and intrinsic number of 
object s in the survey (g i ven t he survey volume) we compute the minimum halo mass 
in the ISheth fc Tormenl (119991 ) model required to match the input number density 



Compute the average bias of the sample. We calculat e the average bias of the sample 
using Press-Schechter model ( jPress fc Schechterlll974j ). 



Integrate the dark matter £(r) over the survey volume. The dark matter two point 
correlation func tion is integrated over t he pencil beam geometry of the volume (cf. 
Eq. [31 see also iNewman &: David |2002| ) to obtain the dark matter cosmic variance 



a 



dm- 



Multiply o~dm by the average galaxy bias to obtain the cosmic variance of the sample: 



a; 



b 2 o- 2 DM . 
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• Take into account Poisson noise for the number of observed objects. The total error 
budget is given by combining the contribution from cosmic variance, which is an intrin- 
sic property of the underlying galaxy population, with the observational uncertainty 
related to the actual number of observed objects N oss . Therefore the total fractional 
error (that is the one sigma uncertainty) is: 

v r = v 7 ^ 2 + l/N oss . (4) 

In Sec. H]we use this method to estimate the number counts uncertainty for typical high 
redshift surveys. 



3. Synthetic surveys 



In this section we present the numerical framework based on cosmological simulations 
that we developed to address the influence of cosmic variance on high redshift observations. 



3.1. N-body simulations 



The numerical simulations ha ve been carried out using the public version of the PM- 
Tree c ode Gadget-2 (lSDringelll2005l ). We adopt a cosmology based on the third year WMAP 
data JSpergel et alJbood l: Qa = 0.74, Q m = 0.26, H = 70 km/s/Mpc and a 8 = 0.75 (or 
cr 8 = 0.9, see later), where fl m is the total matter density in units of the critical density (p c = 
3Hq/(8ttG)) with H being the Hubble cons tant (paramet erized as H = lOO/i km/s/Mpc) 
and G the Newton's gravitational constant jPeeblesI 1 19931 ) . Q\ is the dark energy density 
and ag is the root mean squared mass fluctuation in a sphere of radius 8 Mpc/h extrapolated 
to z = using linear the ory. The initial co nditions have been generated with a code based 
on the Gr afic algorithm (IBertschingerll200ll ) using a AC DM transfer function computed via 
the fit by lEisenstein fc Hul (119991 ) with spectral index n s = 1. 



A summary of our N-body runs is presented in Table HJ We resort to various box 
sizes, optimized for different survey geometries. To characterize small area surveys such as 
a single ACS field, we use a box of edge 100 Mpc/h simulated with 512 3 particles. This 
choice gives us a volume about 73 times larger than the effective volume probed by one ACS 
field for V-dropouts (~ 5.7 x 5.7 x 420 (Mpc/h) 3 ) and about 86 times larger for i-dropouts 
(» 6 x 6 x 320 (Mpc/h) 3 ). The single particle mass is 5 • 1O 8 M / h and guarantees that the 
host halos that we consider {Mhaio <i 2 ■ 10 10 M Q /h) are resolved with at least 40 particles 
in the deepest survey that we simulate. To investigate the cosmic variance in larger area, 
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less deep surveys, such as GOODS, we use instead a larger box, of size 160 Mpc/h, also 
simulated with 512 3 particles. The total volume of the simulation is about 26 times larger 
than a single i-dropouts GOODS field (» 17.6 x 28.1 x 320 (Mpc/h) 3 ). The single particle 
mass for this run is 2 ■ 10 9 M Q /h. These simulations have <7g = 0.75. In addition, we consider 
a higher resolution simulation with more than twice the number of particles of our basic runs 
(N = 680 3 « 3.1 • 10 8 ) and with a box of edge 128Mpc/h that starts with a 8 = 0.9. This 
simulation has a single particle mass of 4.4 • 10 s M Q /h. 

Dark matter halos are identified in the simulations snap shots (saved every A z = 0.125 
in the redshift interval 10 — 4.5) using the HOP halo finder (lEisenstein &: Hutlll998l ). with 
the following parameters: the local density around each particle is constructed using a 16 
particles smoothing kernel, while for the regrouping algorithm we use S pea k = 200, S sa ddie = 
170, S ou ter = 100 and a minimum group size of 40 particles. The halo ma ss distribution in 
the sn apshots is well described (with displacements within »s 25%) by a ISheth fc Tormen 
(119991 ) mass function. One limitation that has to be taken into account when estimating the 
cosmic variance from N-body simulations is the cosmic variance of the complete simulation 
volume. For example, in our 512 3 particles, 100 Mpc/h edge box run there are 2620 dark 
matter halos at z = 6 with more than 100 particles and the differences in this number from 
run to run are larger than the nominal Poisson variance. We checked this effect by carrying 
out a control run with the same initial conditions but a different seed for the random number 
generator obtaining a difference of ~ 6%. The use of a larger box and 680 3 particles in our 
highest resolution run is expected to reduce run to run variations, but we cautiously assume 
a 10% relative uncertainty on the value of the cosmic variance that we derive through this 
paper. 

In addition, in the 680 3 simulation we also save snapshots at intervals Az = 0.5 from 
z = 10 to z = 15 in order to provide a preliminary characterization of cosmic variance in 
future JWST NIRCam surveys. 



3.2. Pencil beam tracing 



Our pencil beam tracer is similar to the one developed by iKitzbichler fc White! (120061 ). 
but it is optimized for high z surveys, allowing us in particular to take advantage of the quasi- 
constant angular distance versus redshift relation. We trace through the simulation box a 
parallelepiped where the base is a parallelogram, whose size is given by the field of view of the 
survey in comoving units, and the depth is the comoving depth associated with the redshift 
uncertainty of the selection window for Lyman Break Galaxies which we are interested in. 
This choice means that we are neglecting the variation of angular distance versus redshift in 
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the redshift interval of the selection window considered. For example, the comoving edge of 
the ACS field of view for V-dropouts is 5.5 Mpc/h at z — 4.6 and 5.9 Mpc/h at z — 5.7, and 
we approximate it with 5.7 Mpc/h. The pencil beam is traced through different snapshots 
as we swipe through its depth, and, consistently with our angular size approximation, we 
assume an average value for Az (the redshift difference between two snapshots) expressed in 
comoving distance. The choice to save snapshots at Az = 0.125 implies that a single beam 
passes through several different snapshots so that the evolution in the halo mass function 
is well captured. Az = 0.125 is equivalent to about 40 Mpc/h at z = 6 and the evolution 
in the number density of halos at the same mass scale between two adjacent snapshots is of 
the order 10 — 15%. The beam starts at a random position within the simulation volume 
and then proceeds through the cube with periodic boundary conditions, angled using the 
following choices for the two direction angles Q\ and 6 2 : 

tan 0i = 0.3 

and 

tan 62 = 0.5. 

These values have been selected to guarantee no superposition and adequate spacing for 
a typical HST-ACS Lyman Break dropouts beam as it wraps around the simulation box 
due to the periodic initial conditions. The linear correlation between the number counts of 
two nearby segments of the beam, estimated using the two point correlation function (see 
Sec. 123} is in fact < 0.02. 

Finally all the dark matter halos within the beam are flagged and saved for subsequent 
processing by the Halo Occupation Distribution part of the Monte Carlo code. A complete 
description of the properties of the pencil beam surveys simulated in this paper is given in 
Tab.d 



3.3. Halo Occupation Distribution 



Dark matter halos within a simulated field of view are populated with galaxies ac- 
cordingly to a simpl e Halo Occupation D istribution (HOD) model. We assume an average 
occupation number (jWechsler et al.ll200ll ): 



(N gal ) = 6{M - M mm )(l + (M/Mi 



(5) 



where 9{x) is the Heaviside step function, M m j n is a minimum halo mass threshold, M\ is 
the typical scale where multiple galaxies are present within the same halo and (3 > is of 
the order unity. If M > M min , one galaxy is placed at the center of the halo and then the 
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number of companions is extracted from a Poisson distribution with mean (M/Mi) 13 . Of 
these galaxies a fraction Jon is finally identified as Lyman Break Galaxies (this is to take 
into account possible observational incompleteness and/or a duty cycle where LBGs are on 
only for a fraction of their lifetime). 

We show in Fig. [1] the distribution of the number counts for i-dropouts in one ACS field 
at the UDF depth. We consider: (i) f ON = 1, Mi — > +oo (that is one galaxy per halo); (ii) 
foN — 0.5 and foN = 0.25, M\ — > +oo (that is one galaxy per halo with either 0.25 or 0.5 
detection probability); (iii) foN = 1, Mi = 5 • 10 u M Q /h and (3 = 1. In addition we also 
investigate the effect of changing the box size, the resolution and the a$ value of the N-body 
simulations using both our iV = 512 3 and our N = 680 3 runs. The value of M m , n is kept 
as free parameter and it is adjusted to have the same average number of counts in all three 
cases. The required variations of M min are limited within a factor two. The distribution of 
the number counts probability is very similar in the four cases (see table [3] for v r values) and 
shows that the characterization of the cosmic variance versus the average number counts is 
solid with respect to the details of the modeling. This is essentially due to the fact that 
by changing M m j„ by a factor of 2, the average bias of the sample varies only by less than 
15%. This is reassuring as there are large theoretical uncertainties on modeling high redshift 
galaxy formation. 



3.4. Field of view geometry 

The importance of modeling correctly the geometry of the field of view to quantify the 
cosmic variance is shown by Fig. [2J Here we focus on a single snapshot at z = 6.125 and we 
measure the total fractional error v r of number counts in a volume of 11520(Mpc//i) 3 with 
different shapes. From Fig. [2l v r is largest in the quasi-cubical volume and smallest for the 
pencil beam. Therefore cosmi c variance computed onl y from the total volume of the survey, 



assumed to be a sphere as in ISomerville et al.l (120041 ). is overestimated. This is because a 



narrow and long pencil beam probes many different environments, while a cubic volume may 
sit for a significant fraction of its volume either on under-dense or on over-dense regions. In 
all cases shown in Fig. [TJ the total error exceeds Poisson noise and the contribution from 
cosmic variance is dominant especially when the average number of counts in the volume is 
much larger than one. Fig. [2] also shows an excellent agreement between the measurement 
from cosmological simulations and the analytical estimate using the two point correlation 
function. 
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3.5. Luminosity-Mass relation 



Our numerical simulations follow the evolution of dark matter only, therefore we have to 
assume a mass-luminosity relation to investigate the influence of the cosmic variance on the 
determination of the luminosity f u nctio n. For t his we extend to higher redshift the fitting 
formulas used by IVale fc Ostrikerl (120041 ) and by ICooray fc Milosavljevid (120051 ) adopting: 



L(M) 



(M/M a 



[q+{M/M s ) cd ) l / d ' 



(6) 



and assuming M s = 10 M, 



- 4.0, q = 0.57, c = 3.57 and d = 0.23. L is conventionally 
set to 1. As the main goal of this paper is not to provide a detailed modeling of the mass 
- luminosity relation, but rather to investigate the dependence of the luminosity function 
parameters on the environment and on the fitting method adopted, the simple modeling of 
eq. [6] appears adequate. 

For a fully consistent representation of the observational selection process for Lyman 
Break Galaxies, it would be necessary to apply a luminosity cut in the apparent and not 
in the absolute magnitude. The apparent luminosity-di stance relation does in fact evolve 
quite significantly over a redshift range Az « 1 (e.g. see iBouwens et al.l 120061 Fig. 7). This 
effect leads to a reduction of the effective volume of the survey and thus to an increase of the 
cosmic variance contribution to the counts uncertainty. This is because dropouts tend to be 
preferentially selected in the low redshift corner of the selection window, even if luminosity 
evolution with redshift will partially offset this trend as galaxies at higher z appear to have a 
higher L/M ratio (see ICoorayl 120051 ) . In our standard model for number counts uncertainty 
from numerical simulations we apply a cutoff in mass (that is in absolute magnitude), which 
gives us a framework consistent with the estimates from the two point correlation function. 
To evaluate the error introduced by this assumption we have implemented a cutoff in observed 
magnitude for an ACS i-dropout sample with 40 detections on average constructed from 
our cosmological simulation at high r esolution and u s ing th e relation between observed L* 
and redshift z as plotted in Fig. 7 of IBouwens et al.l (120061 ). The resulting number counts 
distribution is plotted in Fig. [TJ The total fractional uncertainty in the number counts is 
only marginally higher (0.42 vs. 0.40) than in the case where a cutoff in absolute magnitude 
is applied. 
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Cosmic Variance 



4.1. HST surveys 

The results from our simulated distribution of number counts for different high redshift 
surveys are reported in Figs. [3]|5j For a given survey geometry we plot the fractional number 
counts uncertainty v r as a function of their average value. This facilitates the application to 
surveys at different depths. Given that the details of the Halo Occupation Model have only a 
modest influence on the value of v r (see Fig. |Tj), we resort in this paper to a reference model 
of one galaxy per halo, with f Q N = 1 (i- e - unit probability of detection) unless otherwise 
noted. In addition, we highlight the expected total fractional uncertainty on the number 
counts due to Poisson noise only. 

At the typical number counts for a field at the UDF depth the tot al fractional uncert ainty 



is v r ~ 25% for V-dropouts (assuming 100 detections per field , see lOesch et a 



35% for i-dropouts (assuming 50 detections per field, see iBeckwith et al 



200 



I. 



20071) and 



This 

is much smaller than the Poisson noise associated to the average realized number counts. 
Therefore cosmic variance is the dominant source of uncertainty for UDF-like deep fields. 
At lower number counts per field of view, that is when the survey is shallower, the Poisson 
contribution to the total fractional uncertainty v r increases (see Fig. H]) until it becomes 
dominant at the limit of zero average counts. The agreement between v r estimated using the 
two point correlation function and that measured in the numerical simulations is very good. 
The framework is also consistent with the observed clustering of high-redshift galaxies, such 
as measured by lOverzier et al.l (120061 ): our standard recipe for populating dark matter halos 
with galaxies gives an average galaxy-dark matter bias b = 5.0 for an ACS i-dropout sample 
with 60 detections on average, a value within the one sigma error bar in the measurement by 
Overzier et al.l (120061 ) . Note however that the large observational uncertainties do not allow 
a more detailed quantitative comparison. 

From the full probability distribution on the number counts, derived from our simu- 
lations, we can also evaluate the likelihood of finding a factor 2 overd ensity in galaxies at 
z = 5.9 ± 0.2 in the UDF field as reported by iMalhotra et al.l (120051 ). If we assume that 
the expected number of galaxies in that redshift interval is 7.5, the fractional uncertainty 
in the counts turns out to be v r ~ 67% and the probability of 15 or more realized counts 
in that redshift interval is greater than 10% (this has been measured taking advantage of 
the full probability distribution obtained through the simulations). Therefore the measured 
fluctuation, while lying outside the standard deviation, is not exceptional. Given that a 
i-dropouts selection wind ow contains abo u t 5 in tervals of width Az = 0.2, one overdensity 
such as that observed by IMalhotra et al.l (120051 ) will be present in a significant number of 
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random pointings. 

To characterize the number counts uncertainty in the larger area GOODS survey we 
resort to the simulation with edge 160Mpc/h, whose results are reported in Fig. Ofor the 
combination of the two North and South fields. v r is consistent with the estimate using the 
two point correlation function within a relative difference of 10% at most. Interestingly, the 
i-dropouts cosmic variance in one GOODS field is not too different from the one for a smaller 
but deeper UDF field. This is due to two effects: (i) the increased sensitivity of the UDF 
enables to detect fainter Lyman Break galaxies and thus to probe the distribution of smaller 
mass halos that are progressively less clustered, (ii) there is a significant correlation between 
the counts in adjacent fields. 

To better quantify this latter effect, we plot in Fig. [7] the linear correlation coefficient 
between the i-dropouts counts in two nearby ACS fields: 



The linear correlation has been computed using the analytical model from the two point 
correlation function. Two adjacent i-dropouts fields at the UDF depth (50 counts on average) 
have a linear correlation in the number counts of about 0.4. The correlation decreases rapidly 
as the separation increases and fields separated by more than 500 arcsec have ru n < 0.05. 
Therefore HST parallel fields, typically separated by 550 arcsec, are essentially independent 
from each other. With independent fields the variance of the total counts is given by the sum 
of the variances in each field (as the set of counts in the fields are independent variables). 

For higher redshift Lyman Break Galaxies, we plot in Fig. [5] the number counts uncer- 
tainty within both NICMOS and WFC3 fields. A single NICMOS field has such a small area 
(« 0.72 arcmin 2 ) that v r is dominated by Poisson noise up to a few counts per arcmin 2 . The 
effect of cosmic variance in the number counts uncertainty is instead more evident in a HST- 
WFC3 like field of view, where the uncertainty is significantly higher than Poisson noise. 
A very interesting conclusion that can be drawn from the figure is that the number counts 
uncertainty in the six independent main and parallel NICMOS fields, obtained through the 
HDF, HDF-South, UDF and the UDF follow-up programs, is lower than that of one deep 
WFC3 field, despite covering slightly less area. This example nicely shows that in order to 
minimize cosmic variance effects in future surveys aimed at detecting z > 7 Lyman Break 
Galaxies a sparse coverage is optimal. Of course a continuous coverage has the advantage of 
enabling other science, such as weak lensing studies at lower redshift. 



Tlin 




(7) 



where 



(Tab = ((ab - (ab)) 2 ). 
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4.2. JWST surveys 



As a preliminary characterization of cosmic variance in future JWST Lyman Break 
galaxy surveys, we present in Fig. [8] the cosmic variance for NIRCam F090W, F115W and 
F150W dropouts as given by the combination of the two nearby field of views of NIRCam. 
This has been obtained assuming no evolution from the z = 6 M/L relation (eq. ED- The light 
cone has been traced through snapshots in our 680 3 particles run, considering dark matter 
halos down to 50 particles (that is to a mass limit of 2.2 • lO lo M //i). If the same cutoff 
is applied to an ACS i-dropouts survey, then we get about 250 i-dropouts per fi eld of view 
(corresponding to a magnitude limit m^B — 29.1 using the iBouwens et al.ll2006l luminosity 
function). As can be seen from Fig. [8] the rapid evolution of the dark matter halo mass 
function greatly reduces the number of halos above the cut-off mass within the pencil beam 
at z > 10. Therefore the expected number of F150W dropouts detections per NIRCam field 
is only iV « 0.1 and these observations would be affected primarily by Poisson uncertainty. 
These numbers have been derived assuming a mass luminosity relation of dropouts at z > 6 
consistent with that of i-dropouts, as deep surveys with JWST are expected to reach about 
the z = 6 UDF sensitivity up to z = 20. Of course, it is well possible that the actual 
detections of F150W dropouts will be higher if these primordial galaxies are more luminous 
than their z ~ 6 counterparts, similarly to what happens between z = and z = 6 (e.g., see 



Coorayll2005l ). In addition, for a precise estimate of the expected number of these very high 
redshift objects in deep JWST surveys a detailed modeling of the relation between intrinsic 
and observed lu minosity is require d. At least dust reddening should however play only a 



minor effect (see lTrenti &: Stiavellill2006l ). 



4.3. Ground based narrow band surveys 

If we consider narrow band searches for high redshift galaxies we typically have a differ- 
ent beam geometr y, given by a large field of view with a small redshift depth. For example, 



Ouchi et al.l (120051 ) detect more than 500 Lyman-a emitters at z — 5.7±0.05 in a 1 deg 2 area. 
Under these conditions, we estimate v r ~ 0.18, more than 3 times the Poisson uncertainty of 
the counts. The large variance is given by a combination of a contiguous field of view with 
a small redshift interval probed, which gives a cosmic volume probed by this search roughly 
equivalent to that of a single 10 x 16 arcmin 2 i-dropouts GOODS field. 
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5. Influence of Cosmic Scatter on Luminosity Function parameters 

5.1. Bright-faint counts and environment 

The main influence of the environment probed by a deep field is on the observed number 
density of galaxies and therefore on the normalization of the luminosity function. However, 
the shape of the luminosity function can also be affected. This is the case in the local 
universe, where it has been observe d that the luminosity function in voids is steeper than in 



the field (e.g., see lHoyle et al.ll2005l ). but this possibility see ms to be neglected w hen deriving 



the luminosity function for Lyman Break galaxies (e.g., in lBouwens et al.ll2006l ). 



To highlight the importance of this effect even at high redshift, we plot in Fig. [9] the 
distribution of massive halo [M ha i > 1.2-lO n M ) counts versus total counts down to a lower 
mass halo limit [Mhaio > 3 • 1O 1O M ) for a simulate d i-dropouts survey w ith an ACS area. 
The average number of total counts is roughly at the lBouwens et al.l (120061 ) UDF depth (8.6 
counts per arcmin 2 ), while the counts for the more massive halos have an average density 
of ~ larcmin~ 2 , which is approximately the i-dropouts number density in GOODS. The 
ratio of the massive to total counts can be adopted as an estimate for the steepness (i.e. 
the shape) of the luminosity function. From the right panel of Fig. it is clear that when 
the beam passes through under-dense regions the ratio easily decreases on average by more 
than a factor two, although a large scatter is present. This shows that the shape of the mass 
(luminosity) function does indeed depend on the environment. 

To understand better the origin of this effect we consider two idealized scenarios to model 
the relation between bright and total counts, where we consider only Poisson uncertainties. 
The first (see Fig. [TO]) has been obtained by assuming that the bright counts are completely 
uncorrelated from fainter ones. Therefore we have: 



(N tot ) = (Nft) + (N br ), 



(9) 



where (Nft) (that is the average difference between the total and bright counts) depends 
on the minimum mass cut-off mass considered. When (Nft) » (N{, r ) bright and total 
counts are almost uncorrelated between each other and therefore when a field is underdense 
in bright counts, total counts are almost unaffected and its luminosity function appears on 
average (much) steeper. The luminosity function is instead (quasi) shape-invariant when we 
consider a model with a total correlation between bright and total counts, where 



(N tot )= V (N br ), 



(10) 



with rj depending again on the minimum mass cut-off mass considered. The results for this 
model are shown in Fig. [TTJ and have been obtained by first sampling the bright counts 
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(Nfyr i) from a Poisson distribution with average (N^r), and then sampling the faint counts 
Nf t i from a Poisson distribution with average p x N^ , and p = 7.75. This gives an average 
total number counts (N tot ) = 8.75(N br ). 

A realistic distribution of the counts, such as that obtained from our mock catalogs 
is shown in Fig. [9] and lies between the two extreme cases considered as toy models. In 
particular, the low Nj, r behavior of the Nb r /N tot ratio is dominated by the correlation with 
Nbr, as can be seen by comparing Fig. [HI with Fig. [TUJ This is because the large scale structure 
introduces a correlation between bright and faint counts but this correlation is not total. In 
fact deeper surveys probe smaller mass halos, whose formation probability is sensitive to 
higher frequencies in the power spectrum of primordial density perturbations than that for 
more massive halos targeted in shallower observations of the same field. 



5.2. Luminosity function shape and environment 

The results of the previous section suggest that a more thorough characterization of the 
large scale structure influence on the mass function at high redshift is required. Our main 
aim is to look for systematic variations depending on the realized number counts value and 
on the details of the fitting procedure employed. We fit a Schechter function in the form: 

4>(L)dL = 0*(L/L*) a exp{[-L/L m ])dL, (11) 

to the distribution of galaxy luminosity derived from the dark matter halo masses measured 
in our Monte Carlo code and transformed in luminosities using the prescription of Eq. O As 
discussed in Sec. 13. 5[ our treatment to build a sample of galaxy luminosities is idealized and 
we are missing many observational effects, such as apparent luminosity vs. redshift evolution 
and redshift dependent selection effects within the redshift interval considered. Our main 
aim is to highlight the importance of large scale structure in the fitting of the luminosity 
function and not to construct a detailed representation of observations. 

We start by considering, for a simulated V-dropout deep sample in one ACS field (200 
galaxies on average in the pencil beam), the distribution of the Schechter function parameter 
U and a for 4000 Monte Carlo realization^. We estimate the parameters using a standard 
Maximum Likelih ood estimator on the unbinned detections, following essentially the proce- 



dure described in ISandage et al.l (119791 ) (STJ79). For each synthetic catalog {Li\ i= i^ we 



1 We recall here that not all the 4000 realizations are truly independent as the total volume of the box is 
only about 73 times larger than the pencil beam volume for V-dropouts. 
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compute the likelihood for the luminosity function in eq: [TTJ 

£(L*,a)= J] <P(L t ), (12) 



i=lJV 



where the 0* is fixed by integrating the luminosity function up to the detection limit L min 
of the survey and imposing the normalization: 

X> 

<P(L)dL = 1. (13) 

in 

The maximization of the likelihood £(L*,a) is then carried out on a two dimensional grid 
with spacing Aa = 0.01 and AM* = 0.025, where M* = —2.5 logio(L^). The resulting 
distribution of best fitting parameters for the 4000 synthetic catalogs that we have generated 
are reported in Fig. [12] and highlight the degeneration between M* and a parameters. By 
plotting and a as a function of the number counts of the survey (Fig. [13]) we can see that 
the slope of the luminosity function does not depend on the environment, while L* does. 
When the number of counts in a field is above the average, is larger, although the scatter 
at fixed number of counts is significant. 

To further quantify the effect of cosmic scatter on the luminosity function fitting, we 
consider the case where Lyman Break Galaxies are detected both in a large area deep survey, 
such as GOODS, as well as in a single, or few, pencil beams at a greater depth, such 
as the UDF and UDF follow-up fields. This combination of data appears ideal from the 
observational point of view as it allows us to constraint the break of the luminosity 
function using detections from the large area, shallower, survey, and the faint end slope using 
the deeper dataset. However, the fitting procedure employed may lead to artificial biases, 
especially when one tries to naively correct for the effects of cosmic variance by considering its 
effect on 0* alone. In fact, one might be tempted to re-normalize the luminosity function for 
the deeper fields by considering the number of dropouts galaxies detected at the same depth 



of the larger ar ea, fainter survey. Fo r example, this is what has been done by lBouwens et al. 



(120061 ) (see also lBouwens et al.ll2007l ). who obtained the luminosity function for i-dropouts by 
multiplying the UDF and UDF parallel fields counts by the factors 1.3 and 1.5 (respectively) 
in order to account for a deficit of bright detections with respect to the GOODS fields. 
This correction has two potential problems that may contribute to introducing an artificial 
steepening of the luminosity function and both problems are apparent from Fig. [9] First of 
all it is clear from the left panel of Fig. M that if one were to rescale the luminosity function 
of the deeper field, a relation of the form Nf t = r]N br would not be justified, as we show in 
Sec. 15.11 In fact, the best fitting linear relation for the counts in Fig. [9] is Nft = 4.92iVfo r + 47. 
This implies that a 50% deficit in bright counts with respect to the average would correspond 
on average to only a 20% deficit in faint counts and not to the 50% naive estimate of the 
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deficit. There is also a second, subtler effect introduced by a renormalization of the luminosity 
function based on the number counts: N^ r /Nf t is strongly correlated with jVj, r in underdense 
fields due to the lack of luminous galaxies, so a renormalization of the data based on matching 
the Nb r counts to a reference value introduces an artificial steepening of the faint end of the 
luminosity function. 

As a quantitative example, we consider a test case where the luminosity function is 
determined by GOODS-like data (one 10 x 16 arcmin 2 field) at the bright end and by one 
ACS field at the faint end. The average number counts per arcmin 2 that we adopt are 0.75 
fo r the bright end and 8.75 for the deeper field (to be consistent with the number densities 



in 



Bouwens et al.ll2006l ). This gives us an average of 120 bright objects in the large area field 



and of 100 objects down to the fainter limit of the UDF-like field. 

The luminosity function is then fitted by adopting three different methods: 



in 



(i) first an observed luminosity function is constructed from the synthetic catalogs using 
binned data (0.5 mag bins) by combining the data fro m the two surveys wit h no Large 
Scale Structure correction, as described in Sec 5.3 of iBouwens et al.l (120061 ). Then we 
fit the model luminosity function using a maximum likelihood approach to these binned 
data, that is we compute the theoretical expectation for the number of objects in each 
magnitude bin and then we maximize the likelihood under the assumption that the 
counts in each bin are Poisson distributed. 

(ii) maximum likelihood on binned data, as in point (i) above, but here the luminosity 
function determination includes a large scale correctio n a la Bouwens et al. (that is 
relying on the relation Nf t = r\ ■ Nb r .), as in Sec. 5.1 of IBouwens et al.l (120061 ). 



unbinned maximum likelihood modeling of the data with free normalization of the data 
between the deep and wide fields. The procedure is a straightforward generalization of 
what discussed above for the determination o f the luminosity function in a single field 
and proceeds as described in ISandage et al. Jl979h . where this method has been first 
applied to luminosity function fitting 2 ]. 



2 Here we would like t o stress that th e ISandage et al.l (|1979l ) procedure relies on unbinned data, so the 
STJ79 fit ting adopted by Bouwens et al. ( 2007b is not really an application of this method. Note also that 
Eq. A6 in IBouwens et al.l (|2007l ) for the probability distribution of the counts in each bin is not the Poisson 
one and the maximization of their likelihood returns a result equivalent to maximizing a Poisson probability 
distribution only in the limit of a perfect data-model match, that is when the number of observed objects in 
each bin is equal to the number of expected objects. This difference is likely to affect the confidence regions 
for the best fitting parameters. 
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The results from these different fitting methods are shown in Figs. [HI where we plot the 
contour levels of the distribution of the best fitting luminosity function parameters obtained 
from 600 different combinations of deep and wide fields. All the parameters are shown as a 
function of the total number counts in the deep field. The max imum likelihood method with 
free normalization between the two fields (ISandage et al.l 119791 ) has a one sigma uncertainty 
of ~ 0.15 on a and of ~ 0.3 on M*. The uncertainties are similar for when the fit is 



perfor med on binned data without using the large scale renormalization of iBouwens et al. 
(120061 ). but in this case the fixed normalization between the two fields introduces a bias 
in a, with underdense de ep fields leading to a shallower slope. Applying the environment 
renormalization following IBouwens et al.l (120061 ) does over-correct the problem with a larger 
uncertainty in a and M* (0.21 and 0.36 respectively) and a preference for steeper shapes 
of the luminosity function. This is an artifact as the faint end of the luminosity function 
is overestimated by the correction applied when the dee p field is lacking lumi nous galaxies. 
This behavior is also apparent from the fits performed in IBouwens et al.l (120071 ):. e.g. in their 
Table 6 th ey estimate for i - drop outs a = — 1.77±0.16 using a maximum likelihood approach 
similar to ISandage et al.l (119791 ) and a — 2.06 ± 0.20 using the large scale renormalization 
method. 



6. Conclusion 

In this paper, we present detailed estimates of the variance in the number counts of 
Lyman Break Galaxies for high redshift deep surveys and of the resulting impact on the 
determination of the galaxy luminosity function. The number counts distribution has been 
derived from collisionless, dark matter only cosmological simulations of structure formation 
assuming a M/L relation and has been compared with analytical estimates obtained from 
the two point correlation function of dark matter halos. Halos have been identified in the 
simulations snapshots, saved at high frequency (Az = 0.125 up to z = 10) and a pencil 
beam tracer Monte Carlo code has been used to construct synthetic catalogs of the halos 
above a minimum mass threshold within the selection window for Lyman Break Galaxies at 
different redshifts (corresponding to V,i,z and J dropouts for HST surveys). In addition we 
also consider JWST NIRCam F090W, F115W and F150W dropouts up to z = 15. 

By populating the dark matter halos with galaxies using different semi-analytical pre- 
scriptions, that include multiple halo occupation and different detection probabilities, we 
have shown that, to first order, the standard deviation of the number counts for a given 
dropout population depends mainly on the average value of number counts and on the ge- 
ometry of the pencil beam. This is because the average bias of the population varies little 
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as a function of the number density and is reassuring for the robustness of our results, as 
it means that the still uncertain details of high redshift galaxy formation are unlikely to 
significantly affect the amount of cosmic variance in deep surveys. 

The distribution of the number counts around its central value is highly skewed for low 
number counts, while it becomes progressively more symmetric as the average number of 
objects in the field of view increases. The ratio of the measured variance to the variance 
expected from Poisson noise is an increasing function of the average number of objects in 
the field. For the typical number counts of V and i-dropout in an ACS field of view at the 
UDF depth the one sigma fractional uncertainty is about three times that due to Poisson 
noise. This has a ma jor impact on t h e rari ty of overdensities of high redshift galaxies such 



as those reported by iMalhotra et all (120051 ). that while highly significant with respect to a 



Poisson statistics, turn out to be not uncommon when th e effect of clustering is taken into 



account (e.g. at the 1.5a level in the IMalhotra et al.ll2005l case). 



The geometry of the volume probed is fundamental in defining the number counts 
variance and a long and narrow beam has a lower variance with respect to that estimated 
from an equivalent spherical (or cubical) volume. This is because the long beam passes 
through many different environments, while a spherical volume may happen to sit right on 
top of extreme over-densities or under-densities. For example for i-dropouts an ACS like 
field of view of 6 x 6 x 320(Mpc/h) 3 has a fractional uncertainty of ~ 30% for 100 counts 
on average, while an equivalent cubic volume would have an uncertainty of ~ 50%. 

Number counts in nearby fields are significantly correlated: for i-dropouts, two adjacent 
ACS fields with 50 i-dropouts counts on average, have a linear correlation coefficient Tu n = 
0.41. This becomes ru n < 0.05 at angular separation of about 550 arcsec. This has two 
important consequences: (i) large area surveys with adjacent exposures such as GOODS 
are still affected by a non-negligible amount of cosmic variance, so that v r is of the order 
of « 20% for i-dropouts down to its faint detection limit and combining the two North 
and South fields, and (ii) HST parallel observations, separated by about 600 arcsec, such as 
those from the UDF and UDF follow-up programs, can be considered essentially independent 
fields. For independent fields the cosmic variance decreases as the square root of the number 
of fields (in fact the variance from independent variables sums up in quadrature). From the 
point of view of future observations this also implies that the currently existing six ultra deep 
NICMOS fields have a smaller total cosmic variance with respect to a future single WFC3 
deep field to comparable depth, despite the greater area of the latter. 

Using a simple mass-luminosity relation (Eq. E]) we also investigate the effects of cosmic 
variance on the determination of the galaxy luminosity function. The impact of cosmic 
variance is not limited on the normalization of the luminosity function, but extends also to 
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its shape. In fact, the luminosity function for under-dense regions appears to be steeper than 
for field and cluster environments. By fitting a Schechter function to synthetic catalogs of 
V-dropouts galaxies in a UDF-like survey we find that M* varies by about one magnitude 
from over-dense to under-dense fields, while the slope a remains approximately unchanged. 
An important caveat is that, as we have taken into account essentially only dark matter 
clustering in our modeling, our result could be changed if strong feedback effects due to 
baryon physics are important. 

This dependence of the luminosity function on the number counts of the field has impor- 
tant consequences when attempts to correct for a deficit of detections are made in dataset 
that combine a large area survey with a small, deeper area. In fact, an artificial steepening 
of the estimated luminosity function from binned data may arise when naive corrections 
to account for under-densities are used, such as a re-normalization of the faint end of the 
luminosity function in terms of the ratio of bright counts in the deep area of the survey with 
respect to the average value of bright counts over the whole survey area. 

Therefore to determine the luminosity function for such survey configurations the best 
approach appears to be the maximum likel ihood method applied to the unbinned data, as 



originally proposed by lSandage et al.l (I1979I ). The first step is to determine the shape (a and 
M*) of the Schechter function probability distribution, in case convolved with a detection 
probability kernel to take into account incompleteness and selection effects. This can be done 
by combining the likelihood of both faint and bright counts, but allowing the normalization 
to be a free parameter and to vary among the two samples. Next, 0* is estimated from 
the total number counts of the two samples, compared with the expectation from integration 
of the luminosity function between the relevant luminosity limits. This method appears to 
be relatively unbiased with respect to cosmic variance and represents an approach that does 
not require to quantify the relative normalization of the detections in the different fields 
considered. 

Finally we stress that the intrinsic uncertainty due to cosmic variance present while 
estimating the luminosity function parameters must be taken into account when claims are 
made on the redshift evolution of these parameters. For example, by combining one GOODS- 
like field with a single deep, UDF like field, cosmic variance introduces a la uncertainty in 
a of Aa m 0.15. This error is a systematic contribution that comes on top of any other 
contribution to the total error budget. 
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Fig. 1. — Probability distribution of the i-dropouts number counts for an ACS field with 
40.3 average counts, using different Halo Occupation Models and different simulations boxes: 
iV = 5f2 3 and N = 680 3 (HighRes). The cosmic variance for these runs is quantified in 
table [3l For reference the Poisson distribution for 40.3 counts on average is plotted with a 
dashed line. 
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Fig. 2. — Total fractional error v r of the number counts of dark matter halos in a cosmic 
volume of 1.152 • 10 4 (Mpc/h) 3 at z = 6.125 as a function of the average number counts 
for different shapes of the volume. The solid line is the measurement from our numerical 
simulations while the dashed line is the prediction using the analytic model described in 
Sec. 12.11 The quasi-cubic volume has the highest standard deviation, especially at large 
number counts, where the Poisson noise (green line) has a negligible contribution to the 
total fractional error (see also Fig. H]). 
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Fig. 3. — Total fractional uncertainty v r of the number counts estimated using model (dotted) 
and simulations (solid) for V (red) and i (blue) dropouts for one ACS field (in units of the 
average number of counts) versus the average number of counts per arcmin 2 . The Poisson 
noise associated to the counts is plotted for reference in green. 
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Fig. 4. — Total fractional error (v r ) in the number counts for V and i-dropouts in one ACS 
field in units of the Poisson noise (1/ ■sj {N}) versus the average number of counts ((N)) per 
field. As v r is much bigger than the corresponding Poisson noise, cosmic variance is the 
dominant source of uncertainty in the number counts. 
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Fig. 5. — Like in Fig. 0] (all curves from analytical modeling) but for z and J-dropouts con- 
sidering different fields of view: (i) NICMOS Camera 3 (solid), (ii) 6 independent NICMOS 
Camera 3 fields summed (short dashed), (hi) one WFC3 held (long dashed). Poisson noise 
for a single NICMOS Camera 3 and WFC3 held is shown in green. 
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Fig. 6. — Total fractional error for i-dropouts in the two GOODS fields (solid line: simula- 
tions; dashed line: model) versus the average number of counts per arcmin 2 . Poisson noise 
is plotted for reference in green. 
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Fig. 7. — Linear correlation coefficient ru n for the number counts of i-dropouts in two nearby 
ACS fields with an average of 50 detections per field as function of their separation. 
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Fig. 8. — Total fractional error v r for JWST NIRCam dropouts (filled points) and average 
number of detections (N) per field of view (open points) versus the average redshift of the 
pass-band filter. The total field of view is given by two 2.2 x 2.2arcmin 2 fields separated 
by 30 arcsec. The counts go down to a mass l imit equivalent to a d epth of 3 mag below 
M* for i-dropouts in a ACS field assuming the iBouwens et al.l (120061 ) luminosity function. 
For F150W and F115W dropouts v r is dominanted by Poisson noise, while for F090W v r is 
about twice the Poisson error associated to the estimated 10 counts per field. 
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Fig. 9. — Distribution of the of the total (N tot ) versus bright (N br ) i-dropouts counts (left 
panel) and of the Nb r /N to t ratio (right panel) simulated for one ACS field in function of 
the number of bright counts. The plots have been obtained using 4000 MC realizations. 
Confidence level contours are: red 68%, blue 95%, green 99%. The average number of bright 
counts per field is 1.0 arcmin~ 2 , while the average number of total counts is 8.6 arcmin 2 . 
From the right panel it is clear that the steepness of the luminosity function increases in 
under-dense regions. 
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Fig. 11. — Like in Fig. [9] but for a toy model where Nf t and iV& r are perfectly linearly 
correlated, apart from Poisson fluctuations in Nf t . This correlation leads to a luminosity 
function shape that is environment independent. 
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Fig. 12. — Best fitting Schechter function parameters for V-dropouts from one ACS field with 
200 average detections, simulated with 4000 realizations using our MC code. Parameters have 
been estimated using a Maximum Likelihood analysis. Confidence level contours are: red 
68%, blue 95%, green 99%. 
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Fig. 13. — Maximum Likelihood best fitting Schechter function parameters as function of 
the number counts in the field of view for V-dropouts from one ACS field with an average 
of 200 detections, simulated with 4000 MC realizations. Left panel: a\ right panel: M*. 
Confidence level contours are: red 68%, blue 95%, green 99%. 




Fig. 14. — Best fitting parameters a (left) and M* (right) from unbinned i-dropout data (600 
synthetic catalogs) that are a combination of one GOODS like field (with 120 objects on 
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Table 1: Summary of N-body simulations 



N 


L box [Mpc/h] 


0"8 


512 3 


100 


0.75 


512 3 


160 


0.75 


680 3 


128 


0.9 



Note. — Summary of the N-body simulations of structure formation. The first column reports the number 
of particles N used, the second the box edge size (Lb ox ) and the third the normalization as of the amplitude 
of the power spectrum of density fluctuations. 



Table 2: Pencil Beam Properties 



ID 


Angular Size [(") 2 ] 




Z"max 


Comoving Size [(Mpc/h) 3 ] 


Sep. 


ACS v-drop 


205 x 205 


4.6 


5.7 


5.7 x 5.7 x 420 




ACS i-drop 


205 x 205 


5.65 


6.7 


6.0 x 6.0 x 320 




GOODS i-drop 


600 x 960 


5.65 


6.7 


17.6 x 28.1 x 320 




NIC3 z-drop 


51 x 51 


7.0 


8.5 


1.6 x 1.6 x 342 




WFC3 z-drop 


125 x 137 


7.0 


8.5 


3.9 x 4.3 x 342 




NIC3 J-drop 


51 x 51 


8.5 


10.0 


1.7 x 1.7 x 269 




WFC3 J-drop 


125 x 137 


8.5 


10.0 


4.1 x 4.5 x 269 




JW_F090W-drop 


2 x (130 x 130) 


9.0 


10.8 


2 x (4.5 x 4.5 x 295) 


30" = 1.0 Mpc/h 


JW_F115W-drop 


2 x (130 x 130) 


11.0 


12.7 


2 x (4.6 x 4.6 x 217) 


30" = 1.0 Mpc/h 


JW_F090W-drop 


2 x (130 x 130) 


12.9 


15.1 


2 x (4.8 x 4.8 x 223) 


30" = 1.1 Mpc/h 



Note. - 



Assumed properties for the Pencil Beam Fields used throughout this paper. 
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Table 3: Cosmic variance for the models in Fig. [T] 



ID 




v r 


foN = 


0.25, HighRes 


0.32 


foN = 


0.5 


0.35 


foN = 


1 


0.39 


foN = 


1, HOD 


0.42 


foN = 


1, HighRes 


0.40 


foN = 


1, HighRes, MagCut 


0.42 



Note. — Cosmic variance v r for i-dropouts (40 counts on average) in one ACS field for the different models 
presented in Fig. [TJ 



